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FAST METHOD FOR THE FORWARD AND INVERSE MDCT IN AUDIO CODING 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This patent application claims the benefit of the filing date of United States Provisional 
5 Patent Application Serial Nos. 60/181,271, filed February 9, 2000 and entitled "Fast Method 
for the Forward and Inverse MDCT in Audio Coding"; and 60/184,685, filed February 24, 
2000 and entitled "Fast Method for the Forward and Inverse MDCT in Audio Coding", the 
entire contents of which are hereby expressly incorporated by reference. 

1 0 FIELD OF THE INVENTION 

The present invention relates to audio and image data compression and decompression 
applications. More specifically, the invention relates to a system and method for fast 
computation of modified discrete cosine transform (MDCT) and its inverse modified discrete 
cosine transform (IMDCT). 

15 

BACKGROUND OF THE INVENTION 

Discrete cosine transform (DCT) is used extensively in data compression for image and 
speech signals. For example, the authors in N. Amed, T. Natarajan, and K. R. Rao, "Discrete 
Cosine Transform," IEEE Trans. Commun., vol. COM-23, pp. 90-93, Jan. 1974 [1]; and Man 

20 IK Chao and Sang UK Lee, "DCT methods for VLSI parallel Implementations," IEEE 
Trans. On Acoustics, Speech and Signal Processing, vol. 38, no. 1, pp. 121-127, January 
1990 [2], the contents of which are hereby incorporated by reference, describe DCT methods 
for data compression. Many methods have been proposed to compute the DCT. For 
example, see Nam IR Cho and Sang UK Lee, "DCT Methods for VLSI Parallel 

25 Implementations, " IEEE Transactions on Acoustics, Speech and Signal Processing, vol.38., 
no.l, pp.121-127, January 1990 [11]; and T. K. Truong, I. S. Reed, I. S. Hsu, H. C. Shyu, and 
H. M. Sho, "A Pipelined Design of a Fast Prime Factor DFT on a Finite Field," IEEE Trans. 
Computers, vol. 37, pp. 266-273, Mar. 1988 [12], the contents of which are hereby 
incorporated by reference. 
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Recently, Kek in C. W. Kok, "Fast Method for Computing Discrete Cosine Transform ," 
IEEE Trans, on Signal Processing, vol. 45, no. 3, pp. 757-760, March 1997 [3], the contents 
of which are hereby incorporated by reference, suggested a new DCT method for computing 
the DCT of length N = n^N 2 , where Ni is an even number and N 2 is an odd number. In 
5 this method, the DCT with even length N is decomposed into two DCT's with length N/2 . 
If N 1 2 is an odd number. 

Further, it follows from Michael T. Heideman, "Computation of an Odd Length DCT 
from a Real- Value DFT of the Same Length," IEEE trans, on Signal Processing, vol. 40, no. 
1, pp. 54-61, January 1992 [4], the contents of which are hereby incorporated by reference, 
10 that such a DCT of length N 12 can be computed by the use of the identical length discrete 
Fourier transform (DFT) method developed by Winograd. This method is described in Dean 
P. Kolba and Thomas W. Parks, "A Prime Factor FFT Method Using High-Speed 
Convolution," IEEE Trans on Acoustics, Speech and Signal processing, vol. ASSP-25, no. 4, 
pp. 281-294, August 1977 [5]; and S. Winogard, "On Computing the Discrete Fourier 
15 Transform," Mathematics of Computation, vol. 32, no. 141, pp. 175-199, January 1978. [6], 
the contents of which are hereby incorporated by reference. 

By using the ideas in [3, and 4], a fast method can be developed to compute an 18-point 
DCT. In other words, the standard 18-point DCT is decomposed first into two 9-point DCT. 
Then, such a 9-point DCT can be implemented by Winograd's DFT method. However, 
20 computing the MDCT and its IMDCT of an even length sequence can involve an extensive 
computation. 

Therefore, there is a need for a fast method to compute either the MDCT or IMDCT so 
that it can be implemented easily on either a processor, such as a digital signal processor 
(DSP) or computer. Most recently, the author in K. Konstantinides, "Fast Subband Filtering 
25 in MPEG Audio Coding," IEEE Signal Processing Letters, vol. 1, no. 2, pp. 26-28, Feb. 
1994 [7], the contents of which are hereby incorporated by reference, suggested that the 
MDCT and IMDCT in MPEG audio coding can be converted first into the standard DCT and 
IDCT, respectively. Such a DCT or IDCT can be computed rapidly by the use of Lee's fast 
method in Byeong Gi Lee, "A New Method to Compute the Discrete Cosine Transform," 
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IEEE Trans on Acoustics, Speech and Signal processing, vol ASSP-32, no. 6, pp. 1243- 
1245, Dec. 1984 [8], the contents of which are hereby incorporated by reference. 

SUMMARY OF THE INVENTION 
5 The modified discrete cosine transform (MDCT) and its inverse modified discrete 

cosine transform (IMDCT) can involve an extensive computation, for example, in layer III of 
the MPEG audio coding standard. The present invention computes an 18-point DCT in a 
fast and efficient manner. Furthermore, using this 18-point DCT method, two new methods 
are developed to compute the MDCT and its IMDCT, respectively. The number of 
10 multiplications and additions needed to implement both of these two new methods are 
reduced substantially. 

In one aspect, the invention describes a method performed by a computer for computing 
modified discrete cosine transfer comprising the steps of: 

/7 , \[-y{26-k)-y{21 + k)yb k for 0<£<8 
computing x(k) = < k 

\[y(k-9)-y(26-k)]-b k for 9<*<17" 

17 

1 5 computing 7» = £ x(k) cos[— (2k + l)n] for 0 < n < 1 7 ; 

*=o 36 

defining 7(0) = 7'(0) 12 ; and computing Y(n) = Y'(n) - Y(n - 1) for 1 < n < 1 7 . 

In another aspect, the invention describes an MPEG encoder/decoder comprising: 

f rr^ f[~X26 -k)- y(21 + k)] ■ b k for 0<*<8 

means for computing x(k) = < 

\[y(k-9)-y(26-k)]-b k for 9<£<17'' 

17 

means for computing Y' (n) = £ x(k) cos[ — (2k + l)n] for 0 < n < 1 7 ; 

k=o 36 

20 means for defining 7(0) = 7'(0) / 2 ; and means for computing Y(n) = Y'(n) - Y(n - 1) for 
1<«<17. 
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The encoder/decoder may also comprise of: means for computing Y'{k) = Y{k) ■ b k for 
0<£<17 : 



means 



17 

for computing y"(n) = J^Y'(k) cos[ (2k + l)n] for 0 < n < 17 ; 

*=o 2*18 



means for computing y'(n) = 



y m (n + 9) for 

0 for 

-y"'{21-n) for 

-/"(«- 27) for 



0<«<8 
n = 9 

10<«<26 ' 
27<w<35 



18-1 

5 means for defining y(0) = Y, Y ( k ) ' c k ' and means for computing y(n) = y\n) - y(n - 1) 
for 1 < n < 35 . 



BRIEF DESCRIPTION OF THE DRAWINGS 

The objects, advantages and features of this invention will become more apparent from 
1 0 a consideration of the following detailed description and the drawings, in which: 

FIG. 1 is an exemplary hardware butterfly diagram for a fast 18-18 DCT method; 
FIG. 2 is an exemplary hardware butterfly diagram for a fast 36-18 MDCT method; and 
FIG. 3 is an exemplary hardware butterfly diagram for fast 18-36 IMDCT described. 

15 DEATILED DESCRIPTION 

The system and method of the present invention computes the MDCT and its IMDCT 
respectively with a substantially reduced number of multiplications and additions. 
Additionally, a computer simulation shows that the speed of these two new methods for 
computing the MDCT and IMDCT are 7.2 times and 4.3 times faster than the direct 

20 methods, respectively. The present invention significantly improves the performance and 
effectiveness of data compression methods such as, MP III, MPEG, and other audio/video 
compression methods. By this means, the MDCT or IMDCT defined in Hwang-Cheng 



4 



41557/RRT/C694 

Chiang and Jie-Cherng Liu, "Regressive Implementation for the Forward and Inverse MDCT 
in MPEG Audio Coding, " IEEE Signal Processing Letters, vol.3, no.4, pp.1 16-1 17, April 
1996 [10], the contents of which are hereby incorporated by reference, can be converted first 
into the standard DCT of length TV = 18. Then such an 18-point DCT can be implemented 
5 by using the same length Winograd's DFT method. 

The advantage of this new method over the previous methods is that the 18-point DCT 
can be used to compute both the MDCT and IMDCT simultaneously. As a consequence, the 
computation of the 18-point IDCT needed to perform the MDCT method developed in Hsieh 
S. Hou, "A Fast Recursive Method for Computing the Discrete Cosine Transform," IEEE 

10 Trans on Acoustics, Speech and Signal processing, vol. ASSP-35, no. 10, pp. 1455-1461, 
Oct. 1987 [9], the contents of which are hereby incorporated by reference, is completely 
avoided in this new MDCT method. With these new techniques, the new MDCT and 
IMDCT methods require only a small fraction of the number of multiplications and additions 
that are required in direct methods, respectively. 

15 Fast Method for Computing the 18-point DCT 

Let x(k) be a sequence with length TV = 18 . The DCT of x(k) defined in [3] is given by 



In (1), X(n) can be decomposed into the even and odd indexed output of the DCT. That is, 




for 0<n<l7 



(1) 



A(n) = X(2n) 



for 0 < n < 8 



(2) 



B(n) = X{2n + 1) 



for 0 < n < 8 



(3) 



By [3], (2) and (3) can be shown to be two DCFs with length 9 as fellows: 



A(n) = > a(k) cos[ (2k + \)n 



for 0 < n < 8 



(4) 



where 
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a(k) = x(k) + x{\%-\-k) (5) 

and 



= cos[— (2k + l)n] for 0<«<8 (6) 

k=o 2x9 



where 



b{k) = 2[x(Jfc) - x(l 8 - 1 - k)] cos[-^— (2k + 1)] (7) 

2x18 



B'(n) = B(n) + B(n-l) for 0<«<8 ( 8 ) 

Note that 5(0) = B(-l) . From (8), one obtains 

W> = ^ (9) 

Using the sequence B'(n) obtained by (6), B(n) for 0 < n < 8 can be obtained by the use 
5 of both (9) and the recursive form in (8). 

Since A(n) in (4) is a 9-point DCT, then, it is shown in [5] that this DCT can be 
computed by the use of the identical-length DFT method developed by Winograd [5,6]. To 
illustrate this, first, the decomposition of (4) into two even and odd values of n of the DCT 
appears as follows: 

9-1 

A(2n) - £a(£)cos[ (2k + 1)2*] for 0 < * < 4 (10) 

*=o 2x9 

10 and 

9-1 

^(9-2«) = £a(£)cos[ (2£ + l)(9-2n)] for 0<n<4 (11) 

*=o 2x9 
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It follows from [4] that (10) and (11) can be shown to the following: 

A(2n) = (-\)"Ke{£a'(k) (D " k } (12) 

and 

A(9 - In) = (- if Im{g a'(* W*} (13) 

k=0 

— th 

where co = e 9 is the 9 root of unity and a\k) is defined by 

\af(-l) k+6 k + 4) for 0<k<4 



(14) 



a'(k) = < 

[«((-l)* +6 (9-*) + 4) for 5<k<% (15) 
Define the 9-point DFT as follows: 

A'(n) = %Xk)o)* (16) 

5 It is shown in [4] that the transform in (16) can be computed with a minimum number of 

operations by Winograd's method. The substitution of (16) into (14) and (13) yields 
A(2n) = (-l)"Re{A\n)} for 0<«<4 (17) 

A(9-2n) = (-l) n+] Im{A'(n)} for 5<n<8 (18) 

where A\n) is given in (16). 

The detailed method given in [5] for computing the 9-point DCT in (4) is given in 

Appendix A. One observes from this method that the number of multiplications and 
10 additions are 10 and 34, respectively. In a similar fashion, by replacing a{k) and A(n) by 

b(k) and Z?' (n) respectively, (6) can also be computed by the use of the method given in 

Appendix A. 
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Let x(k) be a sequence of length 18. The fast method for computing the 18-point DCT 
defined in (1) comprises the following three steps: 

(1) . Compute a(k) and b(k) from (5) and (7). That is, 

a(*) = jc(*) + jc(18-l-fc) for 0 < A: < 8 

5 6(Jt) = 2(jc(Jt)-jc(18-l«ifc))cos[— (2* + l)] for 0<£<8. 

36 

(2) . Use the fast method given in Appendix A to compute the 9-point DCT of a(k) , i.e. 

A{n) in (4) and the 9-point DCT of b(k) , i.e., E (n) in (6) for 0 < n < 8 . 

(3) . Use both the recursive formulae for 1 < n < 8 in (8) and (9) to compute B(ri) from 

the known B l (n) for 0</?<8. That is, fi(0) = 5'(0)/2, and 
10 B(n) = B\n) -B(n-l) for 1 < n < 8 . Then set X(2n) = A(n) , for 0 < « < 8 And 

X(2n + 1) = for 0 < » < 8 , where X{n) is a 18-point DCT of x(Jfc) given 

in(l). 

From the method described above, it can be shown that the total number of 
15 multiplications and additions needed to compute (1) are 29 and 97, respectively. In contrast, 
324 multiplications and 306 additions are required for a direct computation of (1). An 
exemplary butterfly diagram of a fast 18-18 DCT is depicted in FIG. 1. FIG. 1 shows an 
exemplary hardware butterfly diagram for the fast 18-18 DCT method. The button diagrams 
indicate the computation flows. The pre-computed constants are 

20 d k =2cos[— (2*+l)] for 0<£<8. 
36 
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Fast method for Computing the MDCT 

Let y(k) be a sequence of length N = 36, the MDCT of y(k) defined in [10] is given by 

W-Z^)cos[^-(2^ + l + 18)(2n + l)] for 0<«<17 (19) 

2*36 

Instead of computing (19), an alternating technique is employed. To see this, replacing k by 
36-1- k in (19), one obtains 



Y(n) = f>(36 - 1 - *) cos[-^-((2 * 36)(2« + 1) - (2k + 1 - 1 8)(2» + 1))] (20) 
It=o 2*36 

5 But 

cos[-^— ((2 * 36(2n + 1) - (2* + 1 - 18)(2w + 1))] = -cos[— ^— (2k + 1 - 18)(2« + 1)] 
2*36 2*36 

Thus, (19) becomes 

35 _ 

Y( n ) = -Yy(36-l-k)cos[-—- (2* + 1-1 8)(2w + l)] for 0 <« < 17 (21) 
t^o 2*36 



If 

Y'(n) = Y(n) + Y(n-l) for 0 <« < 17 (22) 

then 

'(«) = f> Wcos[-£-(2* +1-1 8X2»)] (23a) 
^ 2*36 

10 where 

j>'(*) = -2;K36-l-*)cos[— ^— (2Jfc+l-18)] for 0<Jt<35 (23b) 

2*36 
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The substitution of (21) into (22) yields, 

36-1 jr 

T (n) = - £ y(36 - 1 - k) cos[-£- (2* + 1 - 1 8)(2» + 1)] 

36-1 „ 

- j>(36 - 1 - *) cos[-— (2* + 1 - 1 8X2« - 1)] 

w 36 (24) 

= - Y y(36 - 1 - k) cos[— ^- ((2k + 1-1 8)(2n) +(2k + 1-1 8))] 
to 2*36 

- J X36 - 1 - *) cos[^- ((2* + 1-1 8X2«) -(2* + 1-1 8))] 

*=o 2 36 

(24) can be shown as 



r(») = -|;2^(36-l-*)cos[-^-(2* + l-18)]cos[-^-(2fc + l-18X2/i)] 

ti 2*36 2*36 ^ 

= £y(*)cos[^(2* + l-18>i] 

*5 36 



where y '(k ) is given in (23b). 

From (21), one observes that 7(0) = Y(-l) . Using this result and the recursive formula 
in (22), one yields 

7(0) = F(0)/2 (26a) 
Y(n) = T(n)-Y(n-\) for 1 < « < 17 (26b) 

Using the sequence y'(n) obtained by (25b), the sequence y(ri) for 0<r<17 can be 

obtained by the use of (26a) and (26b). 

Let 
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_(y'(k + 9) for Q<k<26 (27a) 

y = b' (* " 27) for 27<&<35 (27b) 

Then 

r(») = iy'(*)cos£(2* + l)ii] (28) 

The proof of this is similar to that used in Lemma 1 in [7]. To prove this, (25) can be 
rewritten as: 

T (/I) = J y (*) cos[£ (2* + 1 - 1 8)«] +£ y (k) cos[^ (2* + 1 - 1 S)n] (29) 

*=o 36 k=9 36 

Let k = k'+9 , (29) becomes 

-1 -rr 26 77- 

= X y*(£*+9)cos[— (2it , +l)n]+£y(it , +9)cos[— (2*'+l)w] (30) 

jt'=-9 36 36 

5 Also, let £ r +9 = k-21 in the first summation and let k % ~ k in the second summation of (30). 
Then 

n«)=£/(*-27)cos[^ (31) 

But 

cos[— {{2k + l)w - (2 x 36»] = cos[— (2* + \)n] 
36 36 



Thus, (31) becomes 

r {n) = f; /(w) cos[^ (2* + l)n] (32) 

36 

where y\k) is given in (27a) and (27b). 
10 For given y"(k) as defined in (27a) and (27b), 
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X(n) = F(n) = J]x(k)cos[— (2* + l)/i] for0<^<17 (33) 
where 

x(£) = y\k) + /(36 - 1 - k) for 0<£<1 (34) 
The proof of this is also similar to that used in Lemma 2 in [7]. To see this, (32) can be 
rewritten as 

r(n) = Y l l (n) + Y 2 \n) for 0<«<1 (35) 
where 

Y x '(n) = f J y" (*) cos[^ {2k + \)n] (36) 



5 and 



JV(»)= £y'(*)cos[^(2A: + l)»] (37) 
*tig 36 

Let A: = 36 - 1 - . Then Y 2 ' (n) in (37) becomes 

*V(*)= ty*(36-l-A:')cos[^(2(36-l-r) + l>] (38) 

A'=17 30 

However, 

cos — (2 x 36« - (2k' + 1)») = cos — (2k' + Y)n 
36 36 

Hence, let k'= k , (38) becomes 

^2 ' O) = Z y (36 - 1 - k') cos — (2k' + \)n (39) 
*'=o 36 

The substitution (39) and (36) into (35) yields 
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17 „ 17 



' (n) = J> "(* )cos[^(2£ + l)n] + J> "(36 - 1 - k)cos[-^-(2k + 1)»] 
: jTx(jfc)cos[— (2* +1)«] for0<w<17 



36 

where x(k) is given in (33). From the above, it can be seen that (33) is an 18-point DCT 
which can be efficiently computed by the use of the fast method given in the previous 
section. 

Suppose that y(k) is a sequence of length N = 36. For 0 < k < 8 , one has 
27 < 36 - 1 - k < 35 . Thus, from (27) and (23b), (34) becomes 

x(k) = y"(k) + y\36-l-k) 
= y'(k + 9) + y'(8-k) 

= -2 y(26 - k) cos[-?- (2k + 1)] - 2^(27 + k) cos[-^- (2k + 1)] 
2x36 2x36 

= [-X26 -k)- y(27 + k)]-2 cos[-^— (2k + 1)] 

2x36 

Similar, for 9 < A < 17 , (34) becomes 

x(k) = y'(k + 9) + y'(AA-k) 

= -2X26 - *) cos[-^- (2* + 1)] - 2 j<* - 9) cos[-^- (7 1 - 2*)] 
2x36 2x36 

= [y(k -9)-y(26- k)} ■ 2 cos[^ (2k + 1)] 

since cos[ — — — (71 - 2k)] = -cos[ — - — (2k + 1)] . 
L 2x36 2x36 

71 

Let b k = 2cos[ (2& + 1)] be the pre-computed constants. The fast method for 

2x36 

computing the MDCT of y(k) in (19) is composed of the following three successive steps of 
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computation: 

(1) . Compute 

x(k) = [[-yW-k)-y(21 + k)Yb k for 0<*<8 
\[y(k-9)-y(26-k)]-b k for 9<£<17* 

(2) . Use the fast method described in Section II to compute the 18-point DCT of x(k) , 

17 

5 i.e., Y\n) defined in (40). That is,r(rc) = £x(£)cos[— (2k + l)n] for 

0<n<\7 

(3) . Let 7(0) = r(0)/2. 

Then compute Y{n) = Y\n) - Y(n - 1) for 1 <n< 17 . 

10 This new method is simpler and faster than that of the brute-force method. The number 

of multiplications and additions needed for this method and the brute-force method is given 
in Table 1. Table 1 shows that the multiplications and additions needed to implement this 
new method require only 10.03% and 20.95% of the brute-force method, respectively. The 
butterfly diagram of the fast 36-18 MDCT is depicted in FIG. 2. FIG. 2 illustrates an 

15 exemplary hardware butterfly diagram for the above fast 36-18 MDCT method. The button 
diagram indicates the computation flow. The pre-computed constants are 

&,=2cos[-^-(2* + l)]. 

2x 36 

Fast Method for Computing the IMDCT 

Let Y{k) for 0<£<17 be the output sequence of (19). The inverse MDCT of Y(k) 
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defined in [10] is given by 

18-1 

y(n) = ^Y(k)cos[-—(2n + l^)(2k^l)] for 0<n<35 (41) 
*3 2*36 

If y (w) = y(n) + >>(w - 1) , using a proof similar to that used in (24)-(26), then (41) can be 
expressed by 

is-i - 

y(w) = yr(jfe)cos[-^— (2» + 18)(2Jt + l)] for 0<«<35 (42) 
ifco 2 *36 



where 



Y'(k) = 2Y(k)cos[-^—(2k + l)] for 0<£<17 (43) 
2*36 



5 It follows from (4 1 ) that 

18-1 

y(0) = £y(*)cos[-^(19)(2* + 1)] (44) 
to 2*36 

Using the sequence y (n) obtained by (42), again, y{n) can be obtained by the use of the 
initial condition y(0) given in (44) and the recursive formula y(n) = y 1 (n)- y{n - 1) for 
1<«<35. 
Define 



\y\n-9) for 9<w<35 
l/(« + 27) for 0<«<8 



10 From (42), (45) and (46) can be shown to the following 



7t 



(45) 
(46) 



y(w) = /(»-9) = Tr(jfc)cos[— ^— (2* + l)2»] for 9<«<35 (47) 

*=o 2*36 



15 



41557/RRT/C694 



18-1 ^ 
K 



y"(n) = y'(n + 27) = ^Y\k)cos[— — (2n + 72)(2k + \)] for 0<«<8 (48) 

k=o 2 36 



In (47), it is can be shown that y" (1 8) = 0 . Based on [7], the following can be proved: 
In (47) and (48), 

y" (18 + j) = -y" (1 8 - j) for 1 < j < 9 (49) 
y"(18 + j) = y"(\8-j) for 10 < y < 17 (50) 

Thus, for 1 < j < 9 , from (47), one has 



/'(18 + j) = 'fV' (*) cos[-f- • 2(1 8 + j\2k + 1)] 

2 * 36 



k=0 
18-1 



k=Q 
18-1 



= £rWsin[|(2^ + l)]-sin[^(2^ + l)] for \<j<9 

From (52) and (53), ^'(18 + j) = -y"(\8 - j) for 1 < j < 9 . 
For 10 < j < 17 , from (48), one has 



(51) 



= £ r (*) cos[^ (2 * 1 8)(2* + 1) + ^ (V)W + 1)] 

Using cos(^4 + B) = cos ^4 ■ cos B - sin ^4 * sin B , (5 1) becomes 

18-1 • 

/(18 + y) = -£ r (*) sin [T( 2 * + 1 )]- sfa [^( 2 * + 1 )] for i^J* 9 ( 52 ) 
Similarly, 

/(l 8 " ;) = Z 7 '(*) cos[-^- 2(1 8 - ./)(2* + 1)] 



(53) 
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17 



/'(I 8 - j) = £r(*) cos[-^-(2(l 8 - j) + 12)(2k + 1)] 
Ho 2*36 

= f Y\k) cos[-^-(2£ + 1) — ^-(2k + 1)] 
L 2 *36 2*18 

Again, using cos(^4 -B) = cos ^4 • cos 5 + sin A • sin B , (54) becomes 



(54) 



17 



^"( 18 " J) = g 7 W sin[y(2* + 1) .sin-£|-(2* + 1)] (55) 

However, sin — {2k + 1) = - sin y (2& + 1) . Hence, (55) becomes 

/(18 - j) = -2n*)sin[f (2* + 1)] • sin[-^-(2* + 1)] (56) 
S 2 2*18 

From (52) and (54), /(18 + j) = -y\\8- j) for 10 < j < 17 . 

It follows from the above proof that one only computes y"(ri) for 0 < n < 17 . In other 

words, for given y"(n), where 0<«<17, y"(n) for 19<«<35 can be obtained from 

using (49) and (50). Recall that /(18) = 0. 

Also, by defining 

y"'(n) = -y"(n) for 0 < j < 8 (57) 
y'\n) = y\n) for 9 < j <\1 (58) 

Then 

y>) = In^)cos[-^-(2^ + l)«] for 0<«<17 (59) 
to 2*18 

As a result, for 0 < n < 8 , from (48), one yields 
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y\n) = 2n*)cos[-^-«(2* + 1) + 7t{2k + 1)] 



k=0 



= -2n*)cos[-^-(2* + l)ii] 

= -y m (x) 



For 9 < n < 17 , from (47), one obtains y"'(n) = y\ri) . 

In (59), y'\n) is an 18-pointDCT. By replacing y'\n) and 7'(A:) by X(n) and jc(*), 
respectively, the 18-point DCT in (59) is the same as (1). As a consequence, the fast method 
developed in Section II can also be used to compute (59). 

From (45) and (46), one has 



Together with (49), (50), (57) and (58), y'(n) can be expressed in terms of y m (n) as 
follows: 

y '(n) = y\n + 9) = y'\n + 9) for 0 < n < 8 

y'(n) = y\n + 9) = y"(l 8) = 0 for n = 9 

y\n) = y\n + 9) 

= /(18 + (»-9)) 

= -j/(18-(rc-9)) for 10 < rc^l8 

= -y"(27-n) 
= -y"'(27-n) 

y '(n) = -y"'(27 -ri) for 1 9 < n < 26 




for 



27 < n < 35 



0 < n < 26 
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/(n) = -y m (n - 27) for 27 < n < 35 

Let 7(A) for 0 < A: < 17 be the output sequence of (19). Also let 

TV 

b k =2cos[ (2£ + l)] be the pre-computed constants which are the same as those 

2 x 36 

\9tt 

defined in the MDCT method in the previous section. Finally, let c k = cos[ (2k + 1)] . 

2x36 

The fast method for computing (41) is comprised of the following 4 steps of computation: 

(1) . Compute 

Y'(k) = Y(k)-b k for 0<k<\7 

(2) . Use the fast method described in Section II to compute the 18-point DCT 

of Y(k) given in (59). That is, 

y m (n) = f^YXk) cos[-^— (2k + l)n] for 0<«<17 
k=o 2 18 



(3). Compute 



y m {n + 9) for 0 < n < 8 

0 for n = 9 

-y m ill-n) for 10<n<26 

-y"'(n -27) for 27<«<35 



18-1 

(4). L*y{0) = YY{k)- Cl 

k=Q 



Then compute y(ri) = y'(n) - y(n - 1) for 1 < n < 35 . 

It is easy to observe that this new method for computing the IMDCT is simpler and 
faster than that of the brute-force method. The number of multiplications and additions 
needed for this fast method and the brute-force method is given Table 1 . Table 1 shows 
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that the number of multiplications and additions required to implement this new method are 
only 7.25% and 10.03%, respectively, of the brute-force method. The butterfly diagram of 
the fast 18-36 IMDCT is depicted in Figure 3. 
Program Implementation and Simulation Results 
5 The MDCT and its IMDCT methods described above can be computed simultaneously 

by using the 18 -point DCT. These two new methods are implemented using C++ language 
run on a Pentium 133/Windows 98 platform, in which the multiplications and additions are 
implemented using double data type. An example of the source code is given in Appendix 

B. The fast methods are verified by comparing their results of 10 6 computations to those of 
10 the bruit-force methods. The computation times are provided in Table 1, which is in 

millisecond averaged from 10 6 computations. These new methods for computing the 

MDCT and IMDCT require 0.0218 ms and 0.0375 ms as compared to 0.1567 ms and 0.1616 

ms required by the brute-force methods, respectively. 

As a consequence, the new methods for computing the MDCT and IMDCT are 7.2 and 
15 4.3 times faster than the brute-force methods, respectively. In addition to audio compression 

such as MP III method, the resulting improvement is also useful for video compression 

methods such as MPEG method. 

Additionally, the method of the present invention may be implemented in a custom 

application-specific integrated circuits (ASICs), general-purpose programmable DSP using 
20 firmware to program the DSP devices, and customizable arrays such as programmable logic 

arrays (PLAs) and field-programmable arrays (FPAs). Because the present invention uses 

the same encoding and decoding (recording and playback), the complexity of software, 

firmware, and hardware is significantly reduced resulting in cost improvement and 

flexibility, in addition to speed improvement. 
25 FIG. 3 is an exemplary hardware butterfly diagram for fast 18-36 IMDCT described. 

The button diagram indicates the computation flow. The pre-computed constants are 

b k = 2cos[-^-(2£ + l)] and 
2x 36 
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18-1 1Q-. 

y(0) = jY{k) ■ c k where c k = cos[— -(2k + 1)] . 
to 2x36 



Table I: NUMBER OF MULTIPLICATIONS AND ADDITIONS NEEDED TO 
IMPLEMENT THE MDCT AND IMDCT 





MDCT 


IMDCT 


Step 1 


Step 2 


Step 3 


Total 


Step 1 


Step 2 


Step 3,4 


Total 


New 
Method 


Multipliers 


18 


29 


0 


47 


18 


29 


18 


65 


Adders 


18 


94 


17 


129 


0 


94 


35 


129 


Brute-force 
method 


Multipliers 




648 




648 


Adders 




630 




612 
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Table II: MDCT AND IMDCT EXECUTION TIMES IN MILLI SECOND 





MDCT 


IMDCT 


New method 


0.0218 


0.0375 


Brute-force methods 


0.1567 


0.1616 



It will be recognized by those skilled in the art that various modifications may be made 
to the illustrated and other embodiments of the invention described above, without departing 
10 from the broad inventive scope thereof. It will be understood therefore that the invention is 
not limited to the particular embodiments or arrangements disclosed, but is rather intended to 
cover any changes, adaptations or modifications which are within the scope and spirit of the 
claims in the area of signal processing of audio and video signals including compression of 
audio and video signals. 
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